Feature Selection for Case-Based Learning: A Cognitive Bias Approach

نویسنده

  • Claire Cardie
چکیده

Experimental research in psychology, psycholinguistics, and cognitive science has discovered and examined numerous psychological constraints on human information processing. The tendency to concentrate on (a) present, rather than missing cues, (b) the current focus of attention, and (c) recent information are three examples. Short term memory limitations provide a fourth constraint. This paper shows that psychological constraints such as these can be used effectively as domain-independent sources of bias to improve learning. We first show that cognitive biases can be automatically and explicitly encoded into a baseline training instance representation. We then investigate the related problems of cognitive bias interaction and cognitive bias selection, and compare two selection methods that make varying assumptions about the independence of individual component biases. Finally, the paper shows that performance of a nearest-neighbor case-based learning algorithm on a natural language task improves as more cognitive biases are explicitly encoded into the baseline instance representation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integrating case-based learning and cognitive biases for machine learning of natural language

This paper shows that psychological constraints on human information processing can be used effectively to guide feature set selection for case-based learning of linguistic knowledge. Given as input a baseline case representation for a natural language learning task, our algorithm selects the relevant cognitive biases for the task and then automatically modifies the representation in response t...

متن کامل

Automating Feature Set Selection for Case-Based Learning of Linguistic Knowledge

This paper addresses the issue of "algorithm vs. representation" for case-based learning of linguistic knowledge. We first present empirical evidence that the success of case-based learning methods for natural language processing tasks depends to a large degree on the feature set used to describe the training instances. Next, we present a technique for automating feature set selection for case-...

متن کامل

Intrusion Detection based on a Novel Hybrid Learning Approach

Information security and Intrusion Detection System (IDS) plays a critical role in the Internet. IDS is an essential tool for detecting different kinds of attacks in a network and maintaining data integrity, confidentiality and system availability against possible threats. In this paper, a hybrid approach towards achieving high performance is proposed. In fact, the important goal of this paper ...

متن کامل

Online Streaming Feature Selection Using Geometric Series of the Adjacency Matrix of Features

Feature Selection (FS) is an important pre-processing step in machine learning and data mining. All the traditional feature selection methods assume that the entire feature space is available from the beginning. However, online streaming features (OSF) are an integral part of many real-world applications. In OSF, the number of training examples is fixed while the number of features grows with t...

متن کامل

Fast SFFS-Based Algorithm for Feature Selection in Biomedical Datasets

Biomedical datasets usually include a large number of features relative to the number of samples. However, some data dimensions may be less relevant or even irrelevant to the output class. Selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. To this end, this paper presents a hybrid method of filter and wr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007